Sentence-Internal Prosody Does not Help Parsing the Way Punctuation Does

نویسندگان

  • Michelle L. Gregory
  • Mark Johnson
  • Eugene Charniak
چکیده

This paper investigates the usefulness of sentence-internal prosodic cues in syntactic parsing of transcribed speech. Intuitively, prosodic cues would seem to provide much the same information in speech as punctuation does in text, so we tried to incorporate them into our parser in much the same way as punctuation is. We compared the accuracy of a statistical parser on the LDC Switchboard treebank corpus of transcribed sentence-segmented speech using various combinations of punctuation and sentence-internal prosodic information (duration, pausing, and f0 cues). With no prosodic or punctuation information the parser’s accuracy (as measured by F-score) is 86.9%, and adding punctuation increases its F-score to 88.2%. However, all of the ways we have tried of adding prosodic information decrease the parser’s F-score to between 84.8% to 86.8%, depending on exactly which prosodic information is added. This suggests that for sentence-internal prosodic information to improve speech transcript parsing, either different prosodic cues will have to used or they will have be exploited in the parser in a way different to that used currently.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Psycholinguistics Cannot Escape Prosody

Once, sentence processing research set aside prosody in order to focus on syntactic and semantic processing. Experimental sentences were mostly presented visually, often without prosodic markers such as commas. Now that we have made some progress by this ‘divide and conquer’ approach, and now that the technology for working on speech has improved, it may be time to integrate prosody into proces...

متن کامل

Punctuated Parsing: Signposts Along the Garden-Path

Although there has been some speculation concerning the role played by punctuation in parsing, there has been amazingly little empirical investigation of the issue. Punctuation appears to be a widely neglected topic. For the most part, where punctuation has been included in parsing studies, investigators have simply assumed that punctuation, such as commas, can be used to effectively disambigua...

متن کامل

Three Dependency-and-Boundary Models for Grammar Induction

We present a new family of models for unsupervised parsing, Dependency and Boundary models, that use cues at constituent boundaries to inform head-outward dependency tree generation. We build on three intuitions that are explicit in phrase-structure grammars but only implicit in standard dependency formulations: (i) Distributions of words that occur at sentence boundaries — such as English dete...

متن کامل

Commas and Spaces: The Point of Punctuation

While it has been widely assumed that punctuation may play a critical role in parsing, there has been relatively little direct empirical investigation of its effects. Most researchers have either avoided the use of punctuation or have simply assumed that it will serve a disambiguating role. There has been little or no consideration of how ’disambiguation’ might occur or whether it is equally ef...

متن کامل

Towards a Syntactic Account of Punctuation

Little notice has been taken of punctuation in the field of natural language processing, chiefly due to the lack of any coherent theory on which to base implementations. Some work has been carried out concerning punctuation and parsing, but much of it seems to have been rather ad-hoc and performance-motivated. This paper describes the first step towards the construction of a theoretically-motiv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004